Within-Project Defect Prediction of Infrastructure-as-Code Using Product and Process Metrics

نویسندگان

چکیده

Infrastructure-as-code (IaC) is the DevOps practice enabling management and provisioning of infrastructure through definition machine-readable files, hereinafter referred to as IaC scripts . Similarly other source code artefacts, these files may contain defects that can preclude their correct functioning. In this paper, we aim at assessing role xmlns:xlink="http://www.w3.org/1999/xlink">product xmlns:xlink="http://www.w3.org/1999/xlink">process metrics when predicting defective IaC scripts. We propose a fully integrated machine-learning framework for Defect Prediction, allows repository crawling, collection, model building, evaluation. To evaluate it, analyzed 104 projects employed five classifiers compare performance in flagging suspicious The key results study report Random Forest best-performing model, with median AUC-PR 0.93 MCC 0.80. Furthermore, least collected projects, product identify scripts more accurately than process metrics. Our findings put baseline investigating Prediction relationship between metrics, scripts’ quality.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Software defect prediction using static code metrics : formulating a methodology

Software defect prediction is motivated by the huge costs incurred as a result of software failures. In an effort to reduce these costs, researchers have been utilising software metrics to try and build predictive models capable of locating the most defect-prone parts of a system. These areas can then be subject to some form of further analysis, such as a manual code review. It is hoped that su...

متن کامل

Using Source Code and Process Metrics for Defect Prediction - A Case Study of Three Algorithms and Dimensionality Reduction

Software defect prediction is very important in helping the software development team allocate test resource efficiently and better understand the root cause of defects. Furthermore, it can help find the reason why a project is failure-prone. This paper applies binary classification in predicting if a software component has a bug by using three widely used algorithms in machine learning: Random...

متن کامل

Software product metrics used to build defect prediction models

This document presents software product metrics we usually use to build software defect prediction models. 1 Software product metrics This document presents software product metrics we usually use to build software defect prediction models, especially those built by Madeyski and Jureczko [6, 5, 4]. • The metrics suite suggested by Chidamber and Kemerer [2]: Weighted Methods per Class (WMC). The...

متن کامل

Benchmarking cross-project defect prediction approaches with costs metrics

Defect prediction can be a powerful tool to guide the use of quality assurance resources. In recent years, many researchers focused on the problem of Cross-Project Defect Prediction (CPDP), i.e., the creation of prediction models based on training data from other projects. However, only few of the published papers evaluate the cost efficiency of predictions, i.e., if they save costs if they are...

متن کامل

A review of process metrics in defect prediction studies

Process metrics appear to be an effective addition to software defect prediction models usually built upon product metrics. We present a review of research studies that investigate process metrics in defect prediction. The following process metrics are discussed: Number of Revisions, Number of Distinct Committers, Number of Modified Lines, Is New and Number of Defects in Previous Revision. We n...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: IEEE Transactions on Software Engineering

سال: 2022

ISSN: ['0098-5589', '1939-3520', '2326-3881']

DOI: https://doi.org/10.1109/tse.2021.3051492